Accuracy of Memory Reference Traces of Parallel Computations in Trace-Driven Simulation
نویسندگان
چکیده
For given input, the global trace generated by a parallel program in a shared memory mul-tiprocessing environment may change as the memory architecture and management policies change. Consequently, if trace-driven simulation is used, care must be taken to adjust the global trace to reeect the reference pattern that would result from program execution in the new environment. Since the addresses may change as the environment changes, traditional process traces are not suucient. We propose a method for ensuring that a correct global trace is generated in the new environment. This method involves a new characterization of a parallel program that identiies its address change points and address aaecting points. An extension of traditional process traces, called the intrinsic trace of each process, is developed. The intrinsic traces maximize the de-coupling of program execution from simulation by describing the address ow graph and path expressions of each process program. At each point where an address is issued, the trace-driven simulator uses the intrinsic traces and the sequence of loads and stores before the current cycle , to determine the next address. The mapping between load and store sequences and next addresses to issue, sometimes, requires partial program reexecution. Programs that do not require partial program reexecution are called graph-traceable. We initially assume sequential consistency and then discuss the extension to the case where sequential consistency is violated.
منابع مشابه
Techniques for Cache and Memory Simulation Using Address Reference Traces
Simulation using address reference traces is one of the primary methods for the performance evaluation of the memory hierarchy of computer systems. In this paper we survey the techniques used in such a simulation. In both the uniprocessor and shared-memory multiprocessor cases, the issues can be divided into trace collection, trace storage, and trace usage. Trace collection can employ several h...
متن کاملPSnAP: Accurate Synthetic Address Streams through Memory Profiles
Memory address traces are an important information source; they drive memory simulations for performance modeling, systems design and application tuning. For long running applications, the direct use of an address trace is complicated by its size. Previous attempts to reduce address trace size incurred a substantial penalty with respect to trace accuracy. We propose a novel method of memory pro...
متن کاملExecution-Driven Simulation of Shared-Memory Multiprocessors
This paper describes an eecient execution-driven technique for the simulation of shared-memory multiprocessors driven by real programs. Our simulator ooers substantial advantages in terms of reduced time and space overheads when compared to instruction-driven or trace-driven simulation techniques, without signiicant loss of accuracy. The technique produces correctly interleaved address traces a...
متن کاملA Parallel Trace-driven Simulator: Implementation and Performance
The simulation of parallel architectures requires an enormous amount of CPU cycles and, in the case of trace-driven simulation, of disk storage. In this paper, we consider the evaluation of the memory hierarchy of multiprocessor systems via parallel trace-driven simulation. We re ne Lin et al.[10] original algorithm, whose main characteristic is to insert the shared references from every trace ...
متن کاملThe Inaccuracy of Trace-Driven Simulation Using Incomplete Multiprogramming Trace Data
Trace-driven simulation is commonly used to predict the performance of computer systems. However, existing tracing techniques produce traces inadequate for some studies: they do not usually record operating system references, and they produce relatively short traces. This paper explores the impact of these trace distortions on the performance estimates of unipro-cessor memory hierarchies using ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Parallel Distrib. Syst.
دوره 3 شماره
صفحات -
تاریخ انتشار 1992